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We present a structure of the ~650-kDa functional replisome of 
bacteriophage T7 assembled on DNA resembling a replication fork. 
A structure of the complex consisting of six domains of DNA heli- 
case, five domains of RNA primase, two DNA polymerases, and 
two thioredoxin (processivity factor) molecules was determined 
by single-particle cryo-electron microscopy. The two molecules of 
DNA polymerase adopt a different spatial arrangement at the repli¬ 
cation fork, reflecting their roles in leading- and lagging-strand syn¬ 
thesis. The structure, in combination with biochemical data, reveals 
molecular mechanisms for coordination of leading- and lagging- 
strand synthesis. Because mechanisms of DNA replication are highly 
conserved, the observations are relevant to other replication systems. 

cryo-EM structure | replisome | coordination of leading- and 
lagging-strands synthesis | DNA replication | DNA polymerase 

T he reverse polarity and antiparallel nature of the two strands 
in duplex DNA pose a topological problem for their simul¬ 
taneous synthesis. The “trombone” model of DNA replication 
postulates that the lagging strand forms a loop such that the 
leading- and lagging-strand replication proteins contact one an¬ 
other (1). This replication loop contains a nascent Okazaki 
fragment and allows for coordination of leading- and lagging- 
strand synthesis. The bacteriophage T7 replisome has been 
studied extensively (2). Thus, the macromolecular complex of T7 
replication proteins provides an excellent model for structural 
analysis of a replisome. Notably, the human mitochondrial and 
the T7 replication systems are similar (3). 

The phage T7 replisome is relatively simple. Four proteins are 
sufficient for reconstitution of the functional replisome, yet the 
assembled replisome recapitulates all of the key features of more 
complex prokaryotic and eukaryotic systems (2, 4). These pro¬ 
teins are the following: DNA polymerase (gp5) and its proc¬ 
essivity factor, Escherichia coli thioredoxin (trx), single-stranded 
DNA (ssDNA)-binding protein (gp2.5), and the bifunctional 
DNA primase-helicase (gp4). 

Gp4 forms multiple oligomeric forms (5). The negative stain 
EM revealed that hexamers and heptamers are the most abun¬ 
dant (~22% and ~78%, respectively). Upon addition of ssDNA, 
the equilibrium between the two oligomeric forms changes 
(~77% protein rings are now hexameric, and ~18% are hepta- 
meric), indicative of DNA binding (6). The hexamer is an en¬ 
zymatically active form of gp4. It binds to ssDNA, hydrolyzes 
nucleotides, translocates on ssDNA, and unwinds dsDNA (7-9). 
In contrast, the heptamer does not bind to ssDNA (6). 

Crystal structures of all of the individual T7 replication pro¬ 
teins have been determined (10-15). However, to date there is 
no structural information on the T7 replisome or its subassemblies. 
Consequently, intermolecular interactions involved in coordination 
of DNA synthesis have not been identified. We have therefore used 
cryo-electron microscopy (cryo-EM) and single-particle recon¬ 
struction methods to determine a structure of the T7 replisome. 

Results 

Replisome Assembly. We have identified conditions necessary to as¬ 
semble the T7 replication complex. A stable complex was obtained 


with genetically altered gp4 and gp5, trx, and DNA resembling a 
replication fork in buffer containing dTTP (Fig. L4). Gp4 contains 
an amino acid substitution, E343Q. Glutamate 343 functions as a 
catalytic base for hydrolysis of dTTP, an event that leads to trans¬ 
location on DNA. A crystal structure of the hexameric gp4 reveals 
that E343 is in position to activate the water molecule for a nu¬ 
cleophilic attack on the nucleoside y-phosphate (16). Consequently, 
this altered gp4 (gp4-E343Q) binds dTTP but does not catalyze 
hydrolysis on DNA to which it is tightly bound. The genetically 
altered gp5 contains two amino acid substitutions, D5A and E7A, in 
the active site of the 3'-5' exonuclease domain. The altered protein 
exhibits a polymerization activity similar to the activity of wild-type 
DNA polymerase, but its exonuclease activity is reduced by a factor 
of 10 6 (10). 

The DNA consists of a 77-nt oligonucleotide containing two 
self-complementary regions that anneal to form a tetra-loop and 
a 5-bp double-helical region in the middle of the sequence with a 
32-nt 5'-overhang and a 31-nt 3'-overhang (Fig. L4). These two 
overhangs mimic the lagging and the leading strand, respectively. 
Although the sequence is capable of forming intermolecular 
polymers as well as the desired intramolecular monomeric 
species, the monomeric form is favored by inclusion of the tetra- 
loop sequence. The tetra-loop forms a highly stable turn confor¬ 
mation only in the monomeric form that we previously confirmed 
by ID and 2D X H-NMR (17). We assessed the folding prop¬ 
erties of an oligonucleotide resembling a replication fork using 
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Fig-1- cryo-EM analysis and assembly of the replisome. (A) A fork-shaped DNA is formed by folding of ssDNA such that a stable tetra-loop is formed in the 
middle of the sequence flanked by a short double-helical region; two oligonucleotides are annealed to serve as primers. (B and C) Melting profiles of the 
tetra-loop DNA. Samples containing different concentrations of the fork-shaped DNA are represented by colored curved lines. Melting curves ( B ) and their 
first derivatives (C) show a biphasic transition with the melting temperatures (T m ) of 53 ± 2 °C. (D) Fluorescence anisotropy measurements of replisome 
formation. The replication complex assembled in the presence of gp4-E343Q is four times more stable than the complex assembled with wild-type gp4. 
Apparent binding constants are 0.2 ± 0.01 pM and 0.8 ± 0.05 pM, respectively. (£) A representative cryo-EM image. (Scale bar, 10 nm.) (E) The 2D class av¬ 
erages. (Scale bar, 10 nm.) (G) A calculated 3D map of the replisome. The initial model for the 3D calculation was generated using the atomic coordinates of 
the helicase domain of gp4 filtered to a resolution of 60 A. ( H ) Fourier Schell Correlation (FSC). The resolution of 13.8 A is based on the gold-standard FSC0.5 
criterion. (/) Comparison of reprojections from the 3D structure with the 2D class averages. (Scale bar, 10 nm.) 
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fluorescence melting spectroscopy in the presence of a fluores¬ 
cent dye that intercalates into double-stranded DNA. The de¬ 
crease in fluorescence as a function of increasing temperature 
suggests folding of the template and formation of the duplex 
(Fig. IB). The melting profiles show a biphasic transition in¬ 
dicative of formation of a single monomeric species in solution 
with a melting temperature of 53 ± 2 °C (Fig. 1C). In the control 
experiment, a similar DNA lacking a stable tetra-loop sequence 
does not exhibit a biphasic melting transition (Fig. IB). The 
5'-overhang of the folded template contains the primase recog¬ 
nition sequence 5-GGGTC-3'. The lagging-strand primer is a 
tetraribonucleotide complementary to the primase recognition 
sequence, and it has an additional two nucleotides at the 3'-end 
that base-pair with the template: a deoxyribonucleotide at position 

5 and a dideoxynucleotide at position 6. The dideoxynucleotide 
at the 3'-end of the primer prevents primer extension by gp5/ 
trx. The leading-strand primer is a 21-nt oligonucleotide, also 
containing a dideoxynucleotide at the 3'-end (Fig. L4). 

Upon addition of gp5/trx and gp4-E343Q to the fork DNA in 
the presence of dTTP, a stable replication complex is formed as 
evidenced by an increase in fluorescence anisotropy of a fluo¬ 
rescein-labeled template. Gp4-E343Q binds to the replication 
complex assembled on the DNA with an affinity fourfold higher 
than that measured for binding of gp4 (0.2 ± 0.01 pM and 0.8 ± 
0.05 pM, respectively), (Fig. ID). We assessed the binding of 
individual replication proteins to the fork DNA in a filter-bind¬ 
ing experiment, in which the leading-strand primer was radio- 
actively labeled at the 5'-end. Dissociation constants determined 
for gp5/trx and gp4-E343Q are 0.07 ± 0.03 pM and 0.28 ± 
0.05 pM, respectively (SI Appendix, Fig. SI). A stable replication 
complex of gp4-E343Q and gp5 is formed with K d of 0.2 ± 
0.04 pM, consistent with fluorescence anisotropy measurements. 
We have previously characterized gp4-gp5/trx-DNA interactions 
on the lagging and leading strand using a variety of biophysical 
methods (5, 18). The fork DNA was designed such that it com¬ 
bined DNA constructs used in these experiments. 

Preparation of the Cryo-Electron Microscopy Sample and Determination 
of the Replisome Structure. Detailed descriptions of the sample 
preparation protocols are included in Materials and Methods. 
First, the leading-strand primer and the template were annealed 
by incubation at 94 °C and by slowly decreasing the temper¬ 
ature to 25 °C. The 1.1-pM primer template was mixed with 

6 pM gp4-E343Q in a buffer containing 5 pM lagging-strand 
primer, 10 mM dTTP, and 0.001% wt/vol /z-dodecyl p-D-maltoside. 
Gp5/trx was added to the mixture after the 10-min incubation at 
25 °C. Samples of the replisome were adsorbed onto freshly glow- 
discharged holey carbon grids and flash-frozen in liquid ethane 
using a Vitrobot with a controlled temperature and humidity. The 
protein and DNA concentrations are higher than the measured 
dissociation constants (Fig. ID and SI Appendix, Fig. SI), and thus 
favorable conditions are created on the EM grid for complex 
formation. To confirm formation of the protein-DNA complex 
under these conditions, we performed a pull-down experiment 
with a biotin-labeled fork DNA. The 1 pM DNA was incubated 
with 2 pM gp4-E343Q and 10 pM gp5/trx in the buffer described 
above. Replisome samples were separated on streptavidin beads 
and analyzed by SDS/PAGE (SI Appendix, Fig. S2). The bands 
representing gp5/trx and gp4-E343Q on the gel confirm protein 
binding to the biotinylated fork DNA. 

The cryo-EM images (Fig. IE) were collected with a Titan 
Krios cryo-EM. Subsequent structure calculation and refinement 
steps were performed using single- and double-precision floating¬ 
point compilations of Relion-1.4. We used software and computa¬ 
tional infrastructure maintained by SBgrid (19), as well as high- 
performance computing facilities available through the Stampede 
Supercomputer at Texas Advanced Computing Center (20). 


The initial cryo-EM analysis revealed that ~95% of helicase 
rings are hexameric and ~5% are heptameric. Thus, the initial 
model for the 3D calculation was generated using the atomic 
coordinates of a hexameric helicase domain of gp4 [Protein 
Data Bank (PDB) ID code 1CR0] filtered to a resolution of 60 A 
(Fig. 1G). Fig. IF displays representative 2D class averages. We 
clearly see additional densities associated with the hexameric 
ring of DNA helicase likely representing DNA polymerases 
bound to the replisome. To confirm that these densities are gp5/trx 
complexes, we labeled thioredoxin with 5 nm Ni-NTA Nanogold 
(SI Appendix, Fig. S3). Thioredoxin was altered by addition of a 
poly-His tag at the N terminus. Gp5-trx complex was purified 
on Ni-NTA agarose (SI Appendix, Fig. S3A) and used to prepare 
samples for EM. Negative-stain EM reveals gold particles co¬ 
localizing with the replisome, confirming the presence of gp5/trx 
in the complex (SI Appendix, Fig. S3 B and C). We see one or two 
gold particles associated with the complex (SI Appendix, Fig. 
S3C). However, this labeling with gold is not quantitative, and 
gold particles are observed that are not attached to the complex. 

We also analyzed samples of the replisome in the absence of 
DNA using negative-stain EM. When DNA is not present, no 
binding between gp4-E343Q and gp5/trx is detected, indicat¬ 
ing that DNA is required for assembly of the functional gp4- 
E343Q-gp5/trx complex. In the absence of DNA, ~70% helicase 
rings are heptameric, and ~20% are hexameric, consistent with 
previous negative-stain EM analysis (5-8), native gel electro¬ 
phoresis (5, 6, 21), and gel-filtration data (9). 

Structural classification with RELION resulted in two 3D 
maps. One of these maps was then used for further refinements 
(Fig. 1G). The final 3D map was calculated using ~50% (79,519) 
particle images. The presented structure is thus representative of 
the entire dataset. The angular distribution plot is presented in 
SI Appendix, Fig. S5. A reported resolution of 13.8 A is based 
on the gold-standard 0.5FSC criterion (Fig. 1 H). The repro¬ 
jections from the structure are in good agreement with the 2D 
class averages (Fig. II and SI Appendix, Fig. S6). 

Well-defined electron densities allow for unambiguous fitting 
of two molecules of gp5/trx (PDB ID code 1T7P), six subunits of 
DNA helicase (PDB ID code 1CR0), and five of six subunits 
of RNA primase (PDB ID code 1NUI). Automatic docking of 
atomic coordinates into the cryo-EM map was performed with 
the Chimera software using the protocol described in SI Ap¬ 
pendix. The cross-correlation coefficient (CCC) calculated for 
simultaneous docking of all PDBs (i.e., two copies of PDB ID 
code 1T7P, five copies of PDB ID code 1NUI, and PDB ID code 
1CR0) is 0.91. The CCCs calculated for individual protein do¬ 
mains range from 0.83 to 0.99 and are presented in SI Appendix, 
Fig. S7. The local CCCs mapped onto the cryo-EM structure are 
presented in SI Appendix, Fig. S8. 

To determine the handedness in the map of the replisome, we 
performed an automatic docking of atomic coordinates into a 
mirror copy of the map across the “y-z” plane. The local CCCs 
calculated for a chiral copy of the replisome map are shown in SI 
Appendix, Fig. S8. The model presented in this article represents 
an enantiomer displaying a significantly better fit than its chiral 
counterpart (SI Appendix, Fig. S8). We also performed a random 
canonical tilt experiment (SI Appendix, Fig. S9). 

Architecture of the Replisome. An overall architecture of the T7 
replisome is presented in Fig. 2A and Movie SI. The structure 
reveals the presence of an asymmetric hexameric ring of DNA 
helicase to which two DNA polymerases bind in an asymmetric 
fashion. Likewise, the associated DNA primase domains of gp4 
are arranged asymmetrically (Fig. 2 B). Two molecules of gp5/trx 
are bound to the hexameric ring of gp4 at the back of the 
replisome relative to the direction of dsDNA unwinding by the 
helicase. Although DNA is not visible, the unique structures 
obtained suggest strongly its presence (Discussion). 
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Fig. 2. Architecture of the replisome. (A) The structure reveals the presence of the hexameric ring of DNA helicase (blue), five covalently linked DNA primase 
domains (orange), and two gp5/trx complexes. The polymerase domain (green) and the exonuclease domain (purple) of gp5 as well as thioredoxin (yellow) 
are indicated. The two molecules of DNA polymerase bind to gp4 in an asymmetric fashion, reflecting their roles in leading- and lagging-strand synthesis (gp5/ 
trxlead and gp5/trxlag, respectively). ( B ) An asymmetric arrangement of RNA primases at the replisome. 
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Asymmetric Positioning of DNA Polymerases at the Replisome. The 

cryo-EM structure provides evidence for the asymmetric positioning 
of the two DNA polymerases at the replisome. The two molecules 
of gp5/trx can be implicated in leading- and lagging-strand synthesis 
[gp5/trx lead and gp5/trx lag , respectively (Fig. 2D)]. The interaction 
between gp5/trx and gp4 differs on the lagging and leading 
strand. We previously reconstituted the gp4-gp5/trxi ead (5) and 
gp4-gp5/trx lag (18) complexes and characterized both interac¬ 
tions. The biochemical analysis is consistent with the replisome 
structure, and thus we can determine the location of gp5/trx lead 
and gp5/trxi ag (Fig. 2). A low-resolution structure of the gp4-gp5/ 
trxi ead complex has been determined by small-angle X-ray scat¬ 
tering (SAXS) (5). The position of gp4-gp5/trx lead in the SAXS 
structure is consistent with the position of one of the two gp5/trx 
molecules from the cryo-EM structure, confirming the assign¬ 
ment of gp5/trx lead . 

Interaction of the Leading-Strand DNA Polymerase with DNA Primase- 
Helicase. Gp5/trxi ead makes contacts with the two neighboring 
subunits of gp4 through the fingers and the exonuclease do¬ 
mains. The a-helix P and the region containing a loop connecting 
the a-helix P with the p-strand 10 (residues 566-586) from the 
fingers domain interact with a region of DNA helicase in close 
proximity to the a-helix (residues 402-416) (Fig. 3 A). Residues 
566-586 are missing in the crystal structure of the gp5/trx primer 
template (10), suggesting flexibility in this region. Interestingly, 
the loading patch of gp5 containing the basic amino acids K587, 
K589, R590, and R591 is located in close proximity to this region 
(SI Appendix , Fig. S10). We previously showed that this basic 
loading patch of gp5 interacts with the C-terminal acidic tail of 
gp4 and is essential for loading gp5 at the replication fork (22). 
The second interaction involves the primase domain from the 
adjacent subunit of gp4 and the region containing the a-helix E* 
from the exonuclease domain of gp5/trx lead . The a-helix E* is 
unique for gp5/trx in the Pol I family of DNA polymerases. 

In the crystal structure of the gp5/trx primer template, trx 
binds to a unique 76-amino-acid loop, the thioredoxin-binding 
domain (TBD), inserted in the thumb subdomain. The cryo-EM 
structure reveals that trx contacts the TBD in an extended con¬ 
formation. The electron density is slightly larger than a contour 
level of trx (Fig. 24), suggesting flexibility in this region, consistent 
with SAXS analysis of the trx-TBD interaction (23). A loop con¬ 
necting a-helices HI and H2 in the thumb domain makes contacts 
with the tip of the fingers, an interaction not observed in the crystal 
structure (Fig. 3 B). In summary, gp5/trx lead makes numerous in¬ 
teractions with gp4. The thumb and fingers of gp5/trx lead adopt a 
conformation similar to a closed conformation of the polymerase 
active site observed upon binding of gp5/trx to a primer template in 
the presence of an incoming nucleotide (Fig. 3 B and C) (10). 

Interaction of the Lagging-Strand DNA Polymerase with DNA Primase- 
Helicase. Gp5/trxi ag makes extensive contacts with two primase do¬ 
mains from adjacent subunits in gp4 through the fingers and the 
exonuclease domain (Fig. 3D). However, in contrast to gp5/trx lead , 
the polymearse active site of gp5/trxi ag adopts an open conformation 
(Fig. 3 E). In this open conformation, the fingers rotate outward 
away from the polymerase active site. The observed open confor¬ 
mation is consistent with the biochemical data indicating that DNA 
primers >21 nt are necessary for closure of the fingers (23) and 
supports the observation that the presence of the primase is re¬ 
quired for stabilization of shorter primers in the polymerase active 
site of gp5 (18). Interestingly, a loop connecting the a-helix P with 
the p-strand 10 and a-helices E* and F from the exonuclease do¬ 
main is also located in this interface, indicating that gp5/trx lag uses 
similar structural motifs to contact gp4 as does gp5/trxi ead . The TBD 
of gp5/trx lag displays an extended conformation. As in gp5/trx lead , a 
slightly larger electron density than a contour level of trx indicates 
flexibility in this region (Fig. 3 F). 


The Interaction Between the Leading- and Lagging-Strand DNA 
Polymerase. Gp5/trx lead and gp5/trx lag contact each other at the 
replisome. The protein-protein interface is formed by a region 
containing the a-helices G, Gl, and R from the palm domain of 
gp5/trxi ead and by the tip of the fingers domain of gp5/trxi ag (Fig. 
3G). Taken together, the 3D map of the ~650-kDa replisome 
permits an unambiguous fitting of the atomic coordinates of gp4, 
gp5/trxi ead , and gp5/trxi ag . 

Discussion 

DNA in the Replisome Structure. Although we do not directly ob¬ 
serve the density representing DNA in the map of the replisome, 
the conformation of the replication proteins (gp4-E343Q, gp5/ 
trxiead) is compatible with the one observed in the DNA-bound 
state (5-7, 9, 10). 

To visualize DNA in the replisome samples, we labeled a 
biotinylated fork DNA with quantum dots (QD) and visualized 
protein-DNA complexes by negative-stain EM. A biotin is at¬ 
tached to the tetra-loop sequence in the DNA template (Fig. 
L4). The DNA-QD conjugates were purified using magnetic 
beads coated with diethylaminoethyl cellulose (SI Appendix , Fig. 
S44). A micrograph presented in SI Appendix , Fig. S4 B shows 
QD-DNA conjugates that were negatively stained with uranyl 
formate in the absence of the replication proteins. Mixing of gp4 
and gp5/trx with QD-DNA results in formation of protein 
complexes that colocalize with QDs, indicative of DNA binding 
(SI Appendix, Fig. S4 C and D). This method is not quantitative, 
and QD-DNA is also observed distant from the replisome. 

DNA binding by the replisome is also supported by the fol¬ 
lowing observations: (i) Gp4 in the structure is hexameric. Gp4 
exists in multiple oligomeric forms, but only hexamers bind to 
ssDNA (5-7, 9, 21). In the absence of ssDNA, gp4 is predomi¬ 
nantly heptameric (6). (ii) No binding between gp4 and gp5/trx 
can be detected by negative-stain EM in the absence of DNA. 
(iii) We demonstrated DNA binding by the replisome using 
fluorescence anisotropy measurements with a fluorescein-labeled 
DNA (Fig. ID), filter-binding experiments with P 32 -labeled DNA 
(SI Appendix , Fig. SI), and a pull-down assay with a biotinylated 
fork DNA (SI Appendix , Fig. S2). In addition, the cryo-EM map of 
the replisome reveals a density located in the channel protruding 
through the center of the hexameric ring of DNA helicase that may 
represent DNA (Fig. 24). Previous EM and biochemical data in¬ 
dicate that ssDNA binds to gp4 in this channel (5, 8,14). The lack of 
the density representing DNA implies conformational heterogene¬ 
ity in the sample. For example, it is likely that ssDNA can be bound 
within the central channel of gp4 by different subsets of the six 
subunits of the helicase. In support of this notion, it is difficult to 
visualize the helicase-DNA complexes by cryo-EM even at high 
resolution (24). 

While our manuscript was under review, an article was pub¬ 
lished describing a low-resolution X-ray structure (4.8 A) of the 
gp4-gp5/trx complex in the absence of DNA (25). Consistent 
with our data, gp4 in this structure is heptameric. Three mole¬ 
cules of gp5/trx are associated with gp4 through the C-terminal 
tails of DNA helicase; the interaction described earlier used 
single-molecule methods (26, 27) and SAXS (5). 

Architecture of the Replication Fork. Upon analysis of the T7 replisome 
structure, in combination with the extensive biochemical and muta¬ 
tional studies reviewed in refs. 2 and 4, we propose a functional 
model of the replication fork (Fig. 44). DNA helicase contacts 
dsDNA and separates dsDNA into ssDNA. The two ssDNAs are 
bound by gp5/trxi ead and gp5/trxi ag . Both DNA polymerases are po¬ 
sitioned behind gp4 relative to the direction of dsDNA unwinding. 

The lagging-strand ssDNA passes through the central channel in 
the hexameric ring of DNA helicase (5, 8). The crystal structure of 
the hexameric gp4 reveals the presence of three loops protruding 
into the central cavity (14). These loops contain a number of basic 
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Fig. 3. Intermolecular interactions at the replisome. (A) Interaction of gp5/trxlead and gp4. ( B) The fingers of gp5/trxlead interact with the loop connecting 
a-helices HI and H2 in the thumb domain. (C) Gp5/trxlead adopts a closed conformation of the polymerase active site, in which the thumb and fingers are 
rotated inward. (D) In contrast, the polymearse active site of gp5/trxlag adopts an open conformation. In this open conformation, the fingers rotate outward 
away from the thumb. (£) Interaction between gp5/trxlag and gp4. (E) Trx binds to a unique 76-amino-acid loop, TBD, inserted in the thumb subdomain of 
gp5/trxlag. (G) Binding of gp5/trxlead and gp5/trxlag. The specific interactions are described in the text. 
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Fig. 4. Architecture of the replication fork and a model for formation of the lagging-strand replication loop. (A) A model of the replication fork before 
formation of the replication loop on the lagging strand. ( B ) Polymerization of nucleotides by gp5/trxlag combined with the movement of ssDNA through the 
central cavity of gp4 accounts for formation of the replication loop on the lagging strand. 


residues involved in DNA binding (16). Hydrolysis of dTTP by gp4 
is coupled with a sequential transfer of ssDNA by the DNA- 
binding loops from one subunit to the adjacent subunit within 
the central channel (14, 16). ssDNA exiting from the channel has 
the correct polarity for binding by the primase and the DNA- 
binding cleft of gp5/trx lag . 

Displacement of the leading-strand ssDNA by the C-terminal 
acidic surface of gp4 allows for binding of ssDNA by the positively 
charged active site of gp5/trxi ead . In the model, a primer template 
bound to gp5/trx lead (PDB #1T7P) is located such that the extension 
of the 5'-end of the template would position it in close proximity to 
the subunit interface p-hairpin containing residue F523 of gp4 (SI 


Appendix , Fig. Sll). We showed earlier that F523 interacts di¬ 
rectly with the extruded DNA leading strand (21). The orien¬ 
tation of gp5/trx lead at the replisome helps to stabilize partially 
unwound dsDNA at the junction between ssDNA and dsDNA, 
thus helping to separate dsDNA. Additionally, gp5/trxi ead bind¬ 
ing to gp4 can change the equilibrium between the forward and 
the backward Brownian sliding of DNA helicase along ssDNA 
(28, 29), favoring the movement of gp4 in the 5'-3' direction 
or sliding of the lagging-strand ssDNA through the central 
channel of gp4 in the 3'-5' direction. 

The surface of the replisome is negatively charged with the 
exception of regions implicated in DNA binding (SI Appendix , 
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Fig. S10). These regions have strong electropositive potential, 
namely, the central channel in the hexameric ring of gp4, gp5/trx 
active sites, loading and exchange patches located on gp5/trx, the 
primase domains, and trx. All these regions make contacts with 
DNA in the model. 

Our model differs from the recently proposed model for the 
Saccharomyces cerevisiae replication fork based on a low-resolution 
cryo-EM map (30) of the complex between CMG helicase and the 
leading-strand DNA polymerase epsilon (Pol s). The model pro¬ 
posed by Sun et al. (30) suggests that Pol s is positioned ahead of 
the replisome, possibly functioning as the prow for unwinding of 
dsDNA. In contrast, in our model gp5/trx lead is positioned behind 
the replisome relative to the direction of dsDNA unwinding. In¬ 
consistencies in the two models may reflect differences between 
prokaryotic and eukaryotic replication systems. 

Formation of the Lagging-Strand Replication Loop. The loop on the 
lagging strand implies changes in the orientation of gp5/trx lag . In 
this new orientation, both leading- and lagging-strand synthesis 
occur at a single point and proceed in the same direction. How 
the replication loop is formed is not known. The geometry of 
binding between gp5/trx lag and gp4 in the cryo-EM structure, 
however, does suggest a mechanism. The interaction of gp5/trxi ag 
with the primase domains of gp4 positions gp5/trxi ag such that its 
polymerization activity pushes the newly synthesized dsDNA 
toward the central channel of gp4 (Fig. 4 B). Consequently, both 
the polymerization activity of gp5/trxi ag combined with the move¬ 
ment of ssDNA through the central cavity of the helicase account 
for formation of the replication loop on the lagging strand. 

Coordination of Leading- and Lagging-Strand Synthesis. Coordina¬ 
tion of leading- and lagging-strand synthesis is critical for high 
processivity of the replisome (5, 31-36). However, the enzymatic 
events on the lagging and leading strands are not equivalent. The 
lagging-strand DNA polymerase repeatedly dissociates from 
DNA and recycles to a new primer upon completion of each 
Okazaki fragment. Furthermore, one of the slowest events on the 
lagging strand is the recognition of the primase recognition se¬ 
quence, synthesis of oligoribonucleotides by the primase, and 
transfer to gp5/trx lag , which, in the time required for these events, 
would lead to accumulation of approximately 1,000 nucleotides 
of leading-strand DNA. Why does the synthesis of the leading 
strand not outpace the synthesis of the lagging strand? 

Two models have been proposed to achieve coordination of 
leading- and lagging-strand synthesis. In the first model, leading- 
strand synthesis transiently stops allowing the completion of the 
slow enzymatic events on the lagging strand. The 12-s pause al¬ 
lows for the relatively slow process of primer synthesis (31, 32). 
In the second model, coordination is achieved by different syn¬ 
thesis rates of the leading and the lagging strands. Contrary to 
the results described by Lee et al. (32), the studies of the T7 
replisome by Pandey et al. (37) indicated that lagging-strand 
DNA synthesis is 38% faster than leading-strand DNA synthesis. 
In addition, no pausing of replisome movement was detected 
during primer synthesis. 

The different binding modes of gp5/trxi ag and gp5/trxi ead to gp4 
are compatible with the two polymerases synthesizing DNA at 
different rates. The geometry of binding between gp5/trx lead and 
gp4 indicates that the rate of leading-strand synthesis is limited 
by the rate of dsDNA unwinding by the helicase (28). DNA 
synthesis by gp5/trx lag is not limited by its binding to gp4, but 
rather by the availability of ssDNA from the replication loop 
(Fig. 4 B). 

A direct interaction between gp5/trxi ag and gp5/trxi ead at the 
replisome provides a mechanism for coordination of the leading- 
and lagging-strand synthesis. Conformational changes in a structure 
of DNA polymerase (i.e., transitions between open and closed 
conformations of the polymerase active site) resulting from disso¬ 


ciation/association to the primer template offer a sensing mecha¬ 
nism, in which changes on one strand are transmitted to the other 
strand via a protein-protein interaction. For example, an inward 
rotation of the fingers in gp5/trxi ag resulting from the engagement of 
the primer by gp5/trxi ag , and a subsequent extension of the primer to 
an Okazaki fragment, would lead to disruption of the contact be¬ 
tween gp5/trxi ag and gp5/trxi ead . However, upon completion of the 
synthesis of an Okazaki fragment, gp5/trxi ag disengages from the 
primer, an event manifested by the outward rotation of the fingers. 
The fingers of gp5/trx lag are then in contact with gp5/trx lead , pro¬ 
viding an interaction that could modulate leading-strand synthesis 
(Fig. 3G). The geometry of binding between gp5/trxi ead and gp5/ 
trxi ag excludes a possibility to form a similar interaction with a third 
copy of gp5/trx at the replication fork, explaining why only two 
DNA polymerases can be actively engaged in a coordinated DNA 
synthesis. Interestingly, T4 DNA polymerase forms dimers. The 
fingers of T4 DNA polymerase are also involved in mediating the 
protein-protein interaction (38), suggesting an analogous mecha¬ 
nism for coordination of DNA synthesis in the phage T4 repli¬ 
cation system. 

Although absent in the replisome structure, the ssDNA-binding 
protein (gp2.5) encoded by T7 is essential for coordination of the 
synthesis of the two strands (39). In its absence the resulting 
Okazaki fragments are considerably shorter than those synthe¬ 
sized in its presence (39). Gp2.5 stimulates activities of gp4 and 
gp5 during coordinated synthesis of DNA (39). Gp2.5 increases 
the frequency of initiation of lagging-strand synthesis greater 
than 10-fold (4). The acidic 26-residue C-terminal tail of gp2.5 
interacts with the basic loading patch of gp5/trxi ag (26), the re¬ 
gion where the acidic C-terminal tail of DNA helicase also binds. 
A competition between these two overlapping interactions dur¬ 
ing coordinated synthesis provides a switching mechanism to 
release gp5/trx lead from the replisome. Gp2.5 binding to the basic 
loading patch of gp5/trx lag , located in the hinge region connect¬ 
ing the palm domain and the fingers, may affect the inward and 
outward rotation of the fingers. Consequently, gp2.5 can mod¬ 
ulate binding of the fingers of gp5/trxi ag with the palm domain of 
gp5/trxi ead , the interaction important for coordination of leading- 
and lagging-strand synthesis. Because the basic loading patch of 
gp5/trxi ag is located next to the region involved in stabilization of 
gp5/trx lag on the replisome (SI Appendix, Fig. S10), gp2.5 binding 
could also regulate release of the replication loop. 

In summary, the structure of the replisome reveals multiple 
interactions between DNA polymerase, DNA helicase, and RNA 
primase. Spatial topology of binding between the individual rep¬ 
lication proteins provides a structural framework that brings to¬ 
gether multiple enzymatic activities that orchestrate coordinated 
DNA synthesis. 

Materials and Methods 

A more detailed description of materials and methods used in this work can 
be found in SI Appendix, Supplementary Materials and Methods. 

Recombinant Proteins. Proteins were overproduced and purified as described 
previously: wild-type gp4 (40), gp4-E343Q (16), gp5/trx complex in which two 
amino acids in the exonuclease domain of gp5 are replaced with alanine: 
D5A and E7A (10). 

cryo-EM. Samples of the replisome were adsorbed onto freshly glow-dis¬ 
charged holey carbon grids (Quantifoil R1.2/1.3, Cu 400 mesh) and flash- 
frozen in liquid ethane using a Vitrobot with a controlled temperature and 
humidity. Data were acquired with a Titan Krios cryo-transmission electron mi¬ 
croscope (FEI) at Aarhus University, operating at 300 kV and a dose of <20 e~ A' 2 
using the program for the automated data collection Leginon. Images were 
collected using a GATAN 4k x 4k pixel CCD camera. A total of 3,995 images were 
recorded at a defocus of 2-3 \im and a magnification of 50,000 times corre¬ 
sponding to 1.1 9-A pixel size. Contrast transfer function parameters were esti¬ 
mated with ctffind4. All subsequent data-processing steps were performed using 
single- and double-precision floating-point compilations of Relion-1.4 (41). 
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